From Character to Word Level: Enabling the Linguistic Analyses of Inputlog Process Data
نویسندگان
چکیده
Keystroke-logging tools are widely used in writing process research. These applications are designed to capture each character and mouse movement as isolated events as an indicator of cognitive processes. The current research project explores the possibilities of aggregating the logged process data from the letter level (keystroke) to the word level by merging them with existing lexica and using NLP tools. Linking writing process data to lexica and using NLP tools enables researchers to analyze the data on a higher, more complex level. In this project the output data of Inputlog are segmented on the sentence level and then tokenized. However, by definition writing process data do not always represent clean and grammatical text. Coping with this problem was one of the main challenges in the current project. Therefore, a parser has been developed that extracts three types of data from the S-notation: word-level revisions, deleted fragments, and the final writing product. The within-word typing errors are identified and excluded from further analyses. At this stage the Inputlog process data are enriched with the following linguistic information: part-ofspeech tags, lemmas, chunks, syllable boundaries and word frequencies.
منابع مشابه
From keystrokes to annotated process data: Enriching the output of Inputlog with linguistic information
Keystroke logging tools are a valuable aid to monitor written language production. These tools record all keystrokes, including backspaces and deletions together with timing information. In this paper we report on an extension to the keystroke logging program Inputlog in which we aggregate the logged process data from the keystroke (character) level to the word level. The logged process data ar...
متن کاملWriters on the Move: Visualizing Composing Processes Involved in Academic Writing
The present research study aimed to explore covert processes of editing and revision which were involved in writing four different academic text genres (i.e. abstract, conclusion, data commentary, and cover letter) in English language. To this end, six EFL learners with Persian as their mother were recruited to participate in this study. All the participants attended an induction session and ea...
متن کاملThe Intellectual Structure of Knowledge in the Field of Distance Education Using the Co-Word analyses
Background: Co- word analysis is one of the content analysis methods used in scientometric studies and mapping the scientific structure of various fields. The purpose of the present research is to map the structure of distance education using the co-word analysis. Methods: The research method is content analysis using co- word analysis. The research population are 31607 documents indexed in the...
متن کاملThe Effect of Word Meaning on Speech DysFluency in Adults with Developmental Stuttering
Objectives: Stuttering is one of the most prevalent speech and language disorders. Symptomology of stuttering has been surveyed from different aspects such as biological, developmental, environmental, emotional, learning and linguistic. Previous researches in English-speaking people have suggested that some linguistic features such as word meanings may play a role in the frequency of speech non...
متن کاملA Linguistic Account of the Protagonist’s Development in the Grapes of Wrath
The novel as a modern literary genre is generally regarded as the realization of its main character's journey from immaturity to a status of maturity. The character, usually an uncomplicated person unable to cope with the complexities of life at first, gains an insight and understanding to handle his/her complex situation accordingly later in the novel. It is usually agreed in both literary cri...
متن کامل